troubleshoot

N-Gram Pro

Corpus Analysis

info
Text to analyze. Paste directly or import a CSV/TXT/Excel file.
N-Gram Size (N) info
The number of consecutive words/characters to group together. Unigram (1), Bigram (2), etc.

Remove Stopwords info
Enable to filter out common, low-value words (e.g., 'the', 'a', 'is', '的', '是') to focus on more meaningful terms.
Case Sensitive info
Enable to treat uppercase and lowercase letters as distinct. If disabled, 'Apple' and 'apple' will be treated as the same word.

Display Limits
The number of most frequent N-Grams to display in the results table.
The number of least frequent N-Grams to display. Useful for finding rare combinations.
Total Tokens info
0
The total number of words (or characters for Chinese) in the source data after processing.
Total N-Grams info
0
The total number of N-Gram instances generated from the source data.
Unique Combinations info
0
category
The number of distinct N-Gram combinations found.
Rank info
The frequency ranking of the N-Gram.
N-Gram info
The sequence of N words/characters.
Count info
The total number of times this N-Gram appears in the text.
% Share info
The percentage of this N-Gram relative to the total number of N-Grams.
database

No Data Available